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Abstract 



We consider time-space tradeoffs for exactly computing frequency moments and order statistics over 
sliding windows [ 16 1. Given an input of length 2n — 1, the task is to output the function of each window 
of length n, giving n outputs in total. Computations over sliding windows are related to direct sum 
problems except that inputs to instances almost completely overlap. 

• We show an average case and randomized time-space tradeoff lower bound of T ■ S G fi(n 2 ) for 
multi-way branching programs, and hence standard RAM and word-RAM models, to compute the 
number of distinct elements, Fq, in sliding windows over alphabet [n]. The same lower bound 
holds for computing the low-order bit of F and computing any frequency moment Fk for k ^ 1. 
We complement this lower bound with a T • S € 0(n 2 ) deterministic RAM algorithm for exactly 
computing Fk in sliding windows. 

• We show time-space separations between the complexity of sliding-window element distinctness 
and that of sliding-window F mod 2 computation. In particular for alphabet [n] there is a very 
simple errorless sliding-window algorithm for element distinctness that runs in 0(n) time on av- 
erage and uses 0(log n) space. 

• We show that any algorithm for a single element distinctness instance can be extended to an al- 
gorithm for the sliding-window version of element distinctness with at most a polylogarithmic 
increase in the time-space product. 

• Finally, we show that the sliding-window computation of order statistics such as the maximum 
and minimum can be computed with only a logarithmic increase in time, but that a T ■ S € f2(n 2 ) 
lower bound holds for sliding-window computation of order statistics such as the median, a nearly 
linear increase in time when space is small. 



'This work was done while visiting The University of Washington, Seattle, WA. 
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1 Introduction 



Direct sum questions in a computational model ask how the complexity of computing many instances of a 
function / on independent inputs increases as the number of instances grows. The ideal direct sum theorem 
shows that computing n independent instances of / requires an Q(n) factor increase in computing resources 
over computing a single instance of /. 

Valuable though direct sum theorems can be, they require an increase in the number of inputs equal to 
the number of instances and hence do not increase the complexity of the resulting function when measured 
relative to the input sizq^] We are interested in how the complexity of computing many copies of a function 
/ can grow when the inputs overlap so that the total size of the input is not much larger than the input size 
for a single function^] 

A particularly natural circumstance in which one would want to evaluate many instances of a function 
on overlapping inputs occurs in the context of time series analysis. For many functions computed over 
sequences of data elements or data updates, it is useful to know the value of the function on many different 
intervals or windows within the sequence, each representing the recent history of the data at a given instant. 
In the case that an answer for every new element of the sequence is required, such computations have been 
termed sliding-window computations for the associated functions [16]. 

We focus on the questions of when the sliding-window versions of problems increase their complexity, 
and under what circumstances one can prove significantly larger lower bounds for these sliding-window 
versions than can be shown for the original functions. Unlike an ordinary direct sum lower bound, a positive 
answer will yield a proportionately better lower bound relative to the input size. The complexity measure we 
use is the time required for a given amount of storage; i.e., we study time-space tradeoffs of these sliding- 
window problems. Given the general difficulty of proving lower bounds for single output functions, in 
addition to the goal of obtaining proportionately larger lower bounds, comparing the difficulty of computing 
sliding-window versions of functions / and g may be easier than comparing them directly. 

Many natural functions have previously been studied for sliding windows including entropy, finding 
frequent symbols, frequency moments and order statistics, which can be computed approximately in small 
space using randomization even in one-pass data stream algorithms flT^ I71l6l [T9ll20l[T4lfT3i . Approximation 
is required since exactly computing these values in this online model can easily be shown to require large 
space. The interested reader may find a more comprehensive list of sliding-windows results by following 
the references in [ 1 3 ] . 

We focus on many of these same statistical functions and consider them over inputs of length 2n — 1 
where the sliding-window task is to compute the function for each window of length n, giving n outputs in 
total. We write f mn to denote this sliding-window version of a function /. 

Our main results concern the computation of frequency moments and element distinctness over sliding 
windows. Frequency moment Fq is the number of distinct elements in the input. The element distinctness 
problem ED, determining whether all input elements are distinct, is the special case of testing whether Fq is 
equal to the number of inputs. ED is often considered the decision problem that best tracks the complexity 
of integer sorting, a problem for which we already know tight time-space tradeoff lower bounds fTTl [H on 
general sequential computation models like multi-way branching programs and RAMs, as well as matching 

'in contrast, so-called direct product theorems show that the success probability declines exponentially in n. 

2 Computing many copies of a function on overlapping inputs (selected via a combinatorial design) was used as the basis for the 
Nisan-Wigderson pseudorandom generator construction [22], though in that case the total input size is much larger than that of the 
original function. 
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comparison-based upper bounds ll23l . (As is usual, the input is assumed to be stored in read-only memory 
and the output in write-only memory and neither is counted towards the space used by any algorithm. The 
multi-way branching program model simulates standard RAM models that are unit-cost with respect to time 
and log-cost with respect to space. Therefore in discussing complexity, we measure space usage in bits 
rather than words.) 

We prove time-space lower bounds for computing the sliding-window version of any frequency moment 
Fk for k / 1. In particular, the time T and space S to compute Fc n must satisfy T ■ S G f2(w 2 ). (Fi 
is simply the size of the input, so computing its value is always trivial.) Moreover, we show that the same 
lower bound holds for computing just the parity of the number of distinct elements, Fq mod 2, in each 
window. The bounds are proved directly for multi-way branching programs which imply lower bounds for 
the standard RAM and word-RAM models, as well as for the data stream models discussed above. The best 
previous lower bounds for computing any of these sliding window problems are much smaller time-space 
tradeoff lower bounds that apply to the computation of a single instance of Fj~. In particular, for any k ^ 1, 
an input has distinct elements if and only if Fk = n, so these follow from previous lower bounds for ED. 
Ajtai [4] showed that any linear time solution for ED (and hence F^) must use linear space. No larger 
time lower bound is known unless the space S is n°^ l \ In that case, the best previous lower bound for 
computing ED (and hence F^) is a T G Q(ny / log(n/5) / log log(n/5)) lower bound shown in [10]. This 
is substantially smaller than our T ■ S G Q,(n 2 ) lower bound. 

We complement our lower bound with a comparison-based RAM algorithm for any Ff n which has 
T ■ S G 0{n 2 ), showing that this is nearly an asymptotically tight bound, since it provides a general RAM 
algorithm that runs in the same time complexity for any polynomial-sized input alphabej^] 

Our lower bounds for frequency moment computation hold for randomized algorithms even with small 
success probability 2~°( s ^ and for the average time and space used on inputs in which the values are inde- 
pendently and uniformly chosen from [n] . 

It is interesting to contrast our lower bounds for the sliding-window version of Fq mod 2 with those for 
the sliding-window version of ED. It is not hard to show that on average for integers independently and 
uniformly chosen from [n], ED can be solved with T ■ S G 0(n). This can be extended to an algorithm that 
has a similar T ■ S G (D(n) bound for ED m ' n on this input distribution. This formally proves a separation 
between the complexity of sliding-window Fq mod 2 and sliding-window ED. Interestingly, this separation 
is not known to exist for one window alone. 

In fact, we show that the similarity between the complexities of computing ED and ED mn on average 
also applies to the worst-case complexity of deterministic and randomized algorithms. We give a general 
reduction which shows that for any space bound S, by using space 5* £ 5 + (3(log 2 n), one can convert any 
algorithm A for ED running in time T into an algorithm A* that solves ED m ' a in time T* G 0(T log 2 n) 
or alternatively T* G 0(T log n) if T G VL{n 1+& ). That is, there is no sliding-window analogue of a direct 
sum result for ED. 

These results suggest that in the continuing search for strong lower complexity lower bounds, Fq mod 2 
may be a better choice as a difficult decision problem than ED. 

Finally, we discuss the problem of computing the t th order statistic in each window. For these problems 
we see the full range of relationships between the complexities of the original and sliding-window versions 
of the problems. In the case of t = n (maximum) or t = 1 (minimum) we show that computing these 
properties over sliding windows can be solved by a comparison based algorithm in 0(n log n) time and 
only 0(log n) bits of space and hence there is no sliding- windows analogue of a direct sum result for these 

3 As is usual, we use O to suppress polylogarithmic factors in n. 
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problems. In contrast, we show that a T • S G f2(n 2 ) lower bound holds when t = an for any fixed 
< a < 1. Even for algorithms that only use comparisons, the expected time for errorless randomized 
algorithms to find the median in a single window is T £ r2(nloglog s n) lPT5Tl and there is an errorless 
randomized algorithm that precisely matches this bound lfl5l . Hence for many values of S there is an 
approximate direct sum analogue for these sliding- window order statistics. 

Related work While sliding-windows versions of problems have been considered in the context of online 
and approximate computation, there is little research that has explicitly considered any such problems in the 
case of exact offline computation. One instance where a sliding-windows problem has been considered is a 
lower bound for generalized string matching due to Abrahamson Q. This lower bound implies that for any 
fixed string y G [n] n with n distinct values, H^ n requires T ■ S G J7(n 2 /logn) where decision problem 
H y (x) is 1 if and only if the Hamming distance between x and y is n. This bound is an fi(logn) factor 
smaller than our lower bound for sliding-window Fq mod 2. 

One of the main techniques to derive time-space tradeoffs for branching programs was introduced in 
lfTT1 by Borodin and Cook and was generalized to a number of other problems (e.g., ll26l l2l[3l[8l l2Tll24ll ). 
Our lower bounds draw on this method but require some additional work to adapt it to the case of computing 
frequency moments. 

In addition to lower bounds that apply to unrestricted models such as RAMs and general branching 
program models, some of the problems we consider have been considered in structured comparison-based 
models. Borodin et al. [ 12 ] gave a time-space tradeoff lower bound for computing ED (and hence any F k for 
k ^ 1) on comparison branching programs of T 2 -S G f2(ra 3 ) andsinceS" > log 2 n,T-S G Q.{n^l 2 ^f\ogn)). 
Yao [25 ] improved this to a near- optimal T-S G Q,{n 2 ~ e ^), where e(n) = b/(\v.n) 1 / 2 . Since our algorithm 
for computing F^ n is comparison-based, this lower bound is not far from matching our upper bound for the 
sliding-window version of Fj-. 

Finally, we note that previous research implies a separation between the complexities of ED and 
Fq mod 2 in the context of quantum query algorithms: ED has quantum query complexity 0(n 2 / 3 ) (lower 
bound in [lj and matching quantum query algorithm in 0). On other hand, the lower bounds in [9] imply 
that Fq mod 2 has quantum query complexity Q(n). 

Organization In the remainder of this section we more formally define the EE operator, the statistical 
functions we consider, and the multi-way branching program model. In Section [2] we present our lower 
bound for computing frequency moments Ff. and Fq mod 2 over sliding windows followed by a comparison- 
based algorithm that yields a nearly matching upper bound. In Section[3]we give our algorithms for element 
distinctness over sliding windows which show the separation between Fq mod 2 and ED and the fact that 
sliding windows do not signicantly increase the complexity of ED. Finally in Section [4] we give our upper 
and lower bounds for sliding-window computation of order statistics. 

Sliding windows Let D and R be two finite sets and / : D n — > R be a function over strings of length 
n. We define the operation SLIDING-WINDOW, denoted EE, that takes / and returns a function f m : 
jjn+t-l _j. jjt^ deflned by f m (x) = (/(ccj . . . Xj +n _i))* =1 . We concentrate on the case that t = n and 
apply the SLIDING-WINDOW operator to the functions F k , F k mod 2, ED, and O t , the t th order statistic. 
We will use the notation F^ (resp. /j ) to denote the k th frequency moment (resp. the frequency of 
symbol i) of the string in the window of length n starting at position j. 
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Frequency moments, element distinctness, and order statistics Let a = a\a<i . . . , a n be a string of n 
symbols from a linearly ordered set. We define the k th frequency moment of a, -Ffc(a), as Fk(a) = X^eD fi> 
where fi is the frequency (number of occurrences) of symbol i in the string a and D is the set of symbols that 
occur in a. Therefore, Fo(a) is the number of distinct symbols in a and F\(a) = \a\ for every string a. The 
element distinctness problem is a decision problem defined as: ED{a) = 1 if Fq(o) = \a\ and otherwise. 
We write ED n for the ED function restricted to inputs a with \a\ = n. The t th order statistic of a, Ot, is 
the t th smallest symbol in a. Therefore O n is the maximum of the symbols of a and 0[|] is the median. 

Branching programs Let D and R be finite sets and n and m be two positive integers. A D-way branch- 
ing program is a connected directed acyclic graph with special nodes: the source node and possibly many 
sink nodes, a set of n inputs and m outputs. Each non-sink node is labeled with an input index and every 
edge is labeled with a symbol from D, which corresponds to the value of the input indexed at the originating 
node. In order not to count the space required for outputs, as is standard, we assume that each edge can be 
labelled by some set of output assignments. For a directed path it in a branching program, we call the set of 
indices of symbols queried by ir the queries of it, denoted by and we denote the set of answers to those 
queries as A w (we drop the subscript tt when it is understood from the context). 

A branching program computes a function / : D n — > R m by starting at the source and then proceeding 
along the nodes of the graph by querying the inputs associated with each node and following the correspond- 
ing edges. In the special case that there is precisely one output, without loss of generality, any edge with this 
output may instead be assumed to be unlabelled and lead to a unique sink node associated with its output 
value. 

A branching program B is said to compute a function / if for every x G D n , the output of B on x, 
denoted B(x), is equal to f(x). A computation (in B) on x is a directed path, denoted ttb{x), from the 
source to a sink in B whose queries to the input are consistent with x. The time T of a branching program is 
the length of the longest path from the source to a sink and the space S is the logarithm base 2 of the number 
of the nodes in the branching program. Therefore, S > log T where we write log x to denote [log 2 from 
here on. 

A branching program B computes f under \i with error at most rj iff B(x) = f(x) for all but an 7]- 
measure of x G D n under distribution p,. A randomized branching program B is a probability distribution 
over deterministic branching programs with the same input set. B computes a function / with error at most 
r\ if for every input x G D n , Pr b~b[B{x) = f(x)] > 1 — rf. The time (resp. space) of a randomized 
branching program is the maximum time (resp. space) of a deterministic branching program in the support 
of the distribution. 

A branching program is leveled if the nodes are divided into an ordered collection of sets each called a 
level where edges are between consecutive levels only. Any branching program can be leveled by increasing 
the space S by an additive factor of log T. Since S > log T, in the following we assume that our branching 
programs are leveled. 

2 Frequency Moments over Sliding Windows 

We begin with our main lower bound for computing frequency moments over sliding windows and then 
derive a nearly matching upper bound. 
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2.1 A general sequential lower bound for F^ n and (F mod 2) ffln 

We derive a time-space tradeoff lower bound for randomized branching programs computing F^ n for k ^ 1. 
In fact, we show that the lower bound holds even when computing (Fo mod 2) ffln . (Note that the parity of 
Fk, k > 1 is exactly equal to the parity of n; thus the outputs of (Fk mod 2) ffln are all equal to n mod 2.) 

Theorem 2.1. For k = or k > 2. There is a constant 5 > smc/j ?/ia? a/ry [nj-way branching program of 
time T and space S that computes F^ n with error at most r], < r] < 1 — 2~ <55 ', for input randomly chosen 
uniformly from [n] 2n_1 must have T ■ S G f2(n 2 ). 77ie rame lower bound holds for (Fq mod 2) ffln . 

Corollary 2.2. Let k = or k > 2. 

• 77ze average time T and average space S needed to compute (Fk) mn (x) for x randomly chosen 
uniformly from [n] 2n_1 satisfies T ■ S G f2(n 2 ). 

• There is a 8 > smc/i that for < r/ < 1 — 2 _<55 , a«y r)-error randomized RAM or word-RAM 
algorithm computing (Ffc) ffln using time T and space S satisfies T ■ S G 0(n 2 ). 



Proof of Theorem 2.1 We derive the lower bound for Fq first. Afterwards we show the modifications 
needed for k > 2 and for computing (Fo mod 2) ffln . For convenience, on input x £ [n] 2n_1 , we write yi for 
the output F k (xi,..., x i+n -i). 

We use the general approach of Borodin and Cook [11] together with the observation of Q of how 
it applies to average case complexity and randomized branching programs. In particular, we divide the 
branching program B of length T into layers of height q each. Each layer is now a collection of small 
branching programs B' , each of whose start node is a node at the top level of the layer. Since the branching 
program must produce n outputs for each input x, for every input x there exists a small branching program 
B' of height q in some layer that produces at least nq/T > S outputs. There are at most 2 s nodes in B and 
hence there are at most 2 s such small branching programs among all the layers of B. One would normally 
prove that the fraction of x G [n] 2n_1 for which any one such small program correctly produces the desired 
number of outputs is much smaller than 2~ s and hence derive the desired lower bound. Usually this is done 
by arguing that the fraction of inputs consistent with any path in such a small branching program for which 
a fixed set of outputs is correct is much smaller than 2~ s . 

This basic outline is more complicated in our argument. One issue is that if a path in a small program B' 
finds that certain values are equal then the answers to nearby windows may be strongly correlated with each 
other; for example, if xi = Xi +n then yi = y%+\. Such correlations risk making the likelihood too high that 
the correct outputs are produced on a path. Therefore, we reason using the number of outputs from positions 
that are not duplicated in the previous window instead of the total number of outputs produced, arguing that 
with high probability there will be a linear number of such positions. 

A second issue is that inputs for which the value of Fo in a window happens to be extreme, say n - all 
distinct - or 1 - all identical - allow an almost-certain prediction of the value of Fo for the next window. We 
will use the fact that under the uniform distribution, cases like these almost surely do not happen; indeed the 
numbers of distinct elements in every window almost surely falls in a range close to their mean and in this 
case the value in the next window will be predictable with probability bounded below 1 given the value in 
the previous ones. In this case we use the chain rule to compute the overall probability of correctness of the 
outputs. 

We start by analyzing the likelihood that an output of Fq is extreme. 



6 



Lemma 2.3. Let a be chosen uniformly at random from [n] n . Then the probability that the output ofFo(a) 
is between 0.5n and 0.85n is at least 1 — 2e~ n / 50 . 

Proof. For a = a\ . . . a n uniformly chosen from [n] n , 

E[F (a)} = V Pr Bi G [n] such that a* = £} = nil - (1 - l/n) n ] . 

ie[n] 

Hence 0.632n < (1 - l/e)n < E[F (a)] < 0.75n. Define a Doob martingale D t , t = 0, 1, . . . , n with 
respect to the sequence a\ . . . a n by D t = E[i*o(a) | a\ . . . a t \. Therefore Dq = E[Fq(cl)] and D n = Fo(a). 
Applying the Azuma-Hoeffding inequality, we have 

Pr[F (a) i [0.5n,0.85n < Pr F o) - E F o I > O.ln] < 2e' 2 ^^ = 2e~ n/5 °, 

a a 

which proves the claim. □ 

Write W% to denote the contents, {x^, . . . , Xi +n -{\, of the i-th window, which is associated with output 
yi. We will also be interested in the number of positions j > n such that Xj ^ Wj- n . 

Lemma 2.4. Let x be chosen uniformly at random from [n] 2n_1 with n > 2. With probability at least 

1 - 4ne" n / 50 , 

(a) all outputs of F^ n (x) are between 0.5n and 0.85n, and 

(b) the number of positions j > n such that Xj £ Wj-n is at least n/24. 



Proof. We know from Lemma 2.3 and the union bound that part (a) is false with probability at most 



2ne n / 50 . To provide a similar tail bound for part (b) as well, we will need the following argument. 

Let the indicator variable Ij = 1 if Xj ^ Wj- n and otherwise. Similarly let Jj = 1 if Xj ^ 
{x±, . . . , Xj-i,Xj+i, . . . , X2n-i} and otherwise. Now let X = Y^=~n+i Ij ^ e tne number of positions 
j > n such that Xj ^ Wj- n and also define Y = Ylj=n+l ^ ^ s immediate that X < Y and hence a 
lower tail bound for Y will give us a lower tail bound for X. Assuming n > 2, we now have 

E(Y) = (n - 1)(1 - l/n) 2n ~ 2 > n/8. 

In order to provide the desired tail bound we observe that Y is the number of symbols in x that occur exactly 
once. This can be reformulated in terms of a classic balls and bins problem where the balls are the randomly 
chosen symbols of the input and bins are letters of the alphabet. In this terminology Y represents the number 
of bins that contain exactly one ball. Following the approach taken to this problem in ifTTl , it follows that 
the random variables Jj are negatively associated and hence that Y is more closely concentrated around 
its mean than if they were fully independent. It also therefore follows that we can apply a Chernoff bound 
directly to our problem, giving 

*(*<£)<p(r<£)<^<^)<.-* 

We now get the desired bound for parts (a) and (b) combined by another application of the union bound. □ 
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Correctness of a small branching program for computing outputs in unique positions Let B' be an 

[n]-way branching program of height q and let 7r be a path in B' with queries Q G [2n — 1]. An index £ > n 
is said to be unique if xg £ Wi- n D Q. In order to measure the correctness of the small branching program, 
we will restrict out attention to outputs that are produced at positions that are unique. We now upper-bound 
the probability that a small branching program correctly computes outputs of F® n at many unique positions 
in the input. 

Let £ be the event that all outputs of F® n (x) are between 0.5ra and 0.85n. 

Lemma 2.5. Let r > be a positive integer, let e < 1/10, and let B' be an [n]-way branching program of 
height q = en. Let tt be a path in B' with queries Q on which outputs from at least r unique positions are 
produced. Then for random x uniformly chosen from [n] 2n , 

Pr [these r outputs are correct for F^ n (x), £\ttb>(x) = vr] < (17/18) r . 

Proof. Let Zi x , . . . , zi r be the outputs from unique positions produced by B' along tt, and let Q and A be 
the queries and the answers to those queries along n. We upper bound the probability that all of = 

yi r = z ir hold. 

For output yi, we call X{ the left border of the window Wj and call Xj +n _i its right border. We compute 
the difference between two consecutive outputs ?/, and yi-\. If the left border Xi-i of appears in the 
i th window W{, then its contribution to the value of y^ is the same as for y; L -\ \ otherwise we should decrease 
y%-\ by 1 to get y^. Similarly, if the right border Xj +n _i of y^ appears in the {i — l) st window W{-i, it has 
the same contribution to yi and y%-\ \ otherwise, we need to increase yi-\ by 1 to get y; L . Formally we have 

In order to compute the correctness of the outputs yi 1 ,y% 2 , ■ ■ ■ ■, V% r , we apply the chain rule: 

Pr [y h = z h , . . . ,y ir = z ir , £\ir B >(x) = n] 

r 

= II Pr ^ = £ \ y n = z h, ■ • • i Vh-i = z h-n KB'(x) = tt] (2) 

Revealing new information In order to bound the individual probabilities in equation we reveal for 
each ji the following information: 

• the value of the indicator \{ x . _^Wj }> an d 

• the value of the output yj t -\. 
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We denote by £i-\ the event {yj 1 
the following in equation ([2]): 



'in 



>Vit 



jt- 



1 }. The information we reveal corresponds to 



Pr [y 



'Jt 

n 



Zjt > £\£( 



-i,n B '(x) 



Vk 



= E E ft 

m=l fee{o,i} 

x Pr [s&-i 

< max Pr 

me[0.5n,0.85n] 
6e{0,l} 

< max Pr 

me[0.5n,0.85n] 
6e{0,l} 



7T 



m. 



ife-i^J = b,£e-i,WB'(x) = vr 



m, 1 



Zj t , £ 



~- 6, £ £^_l,7TB/(x) = 7T 



7T 



where the penultimate inequality follows since the event £ can only hold if is between 0.5ra and 0.85n. 

We compute the output yj e based on the values of the outputs produced by B' and the information we 
reveal. Since the outputs , yj 2 , . . . , yj e are all produced at unique positions, the answers in A do not give 
any information about the occurrences of Xj l+n ~\ in Wj t -\. Moreover, yj e takes values in {m, m+1, m— 1} 
depending on the values of the indicators in equation ([T]). 

First observe that if Xj e+n -\ is not queried, then the conditioning ttb'{x) = it and £i-\ has no effect 
on it and x^ +n _i is uniformly chosen from [n] and the probability that it equals any one of the m distinct 
values appearing in the window Wj t -\ is precisely m/n. 

Now assume that Xj e + n -i is queried. Wj e -i contains precisely m distinct values, out of which there 
can be at most q values appearing in the answers A to the queries in Q. Even if the queried positions are 
not in this window, the values of y 5l , . . . , yj t _ 1 may imply that some of the values appearing in the answers 
A occur in Wj { -\ and some do not. Let q' < q denote the number that are guaranteed to appear in Wj t -\ 
and q" < q — q' denote the number that are guaranteed not to appear in Wj { -\. Note that by the uniqueness 
of Xj e+n ~i, Xj e+n -i does not contribute to either q' or q". Let W be the set of m — q' remaining distinct 
values taken at unqueried positions in Wj t —\. Since the input distribution is uniform, by symmetry, any 
set W consistent with the above constraints is equally likely and hence W is a uniformly chosen set of 
m — q' distinct elements from an alphabet S of size n — q" which contains Xj {+n -\. By symmetry, the 
probability that the random set W C S contains Xj e+n ^i is the same as the probability that a random 
element s = Xj e+n -\ G E is contained in W C E which is precisely (m — q 1 ) I in — q"). 

For a € { — 1, 0, 1} and c G {0, 1}, we have 



Pr 



Vjt 



m + a 



Vji-i 



Pr 



L {.f 



j(+n — l\ 



Wi 



m ^{x k ^W k } = b,£e-i,TT B '(x) = vr 

Vh-i = m ' 1 {^ / -i0W}.} = b,££-i,ir B '(x) 



TT 



where c is given by the following: c = if a = — 1, 1 if a = 1, and b if a = 0. Therefore the probability 
above reduces to the probability of Xj l+n -\ appearing (c = 1) or not appearing (c = 0) in Wj t -\. Since 



Pr [s W] < 1 

upper bounds in equation Q, we get: 



n 



q = en, we have Pr [s G W] < ^ < < ^f^y < 0.85/(1 - e) < 17/18 since e < 1/10. Similarly, 

0.5n—en 



< 0.5 + e < 0.6 since e < 1/10. Plugging in the larger of these 



Pr [z h ,. ..,z ir are correct for F ffln (x), £\tt b >(x) = vr] < (17/18) r , 
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which proves the lemma. 



□ 



Putting the Pieces Together We now combine the above lemmas. Suppose that TS < n 2 /4800 and let 
q = n/10. We can assume without loss of generality that 5 > log n since we need T > n to determine even 
a single answer. 

i-'" -1 on which B correctly computes Fq 



2.4 



For 



Consider the fraction of inputs in [n] on which B correctly computes Fq. By Lemma 
input x chosen uniformly from [n] 2 ™" 1 , the probability that £ holds and x has at least n/24 outputs at 
unique positions is at least 1 — 4ne -n / 50 . Therefore, in order to be correct on any such x, B must correctly 
produce outputs from at least n/24 outputs at unique positions in x. 

For every such input x, by our earlier outline, one of the 2 s [n]-way branching programs B' of height q 
contained in B produces at least r = [n/24]q/T > 205 correct outputs at unique positions of F^ n (x) of 
input x. 

Therefore, for all but a 4ne~ n / 50 fraction of inputs x on which B is correct, £ holds for x and there is 
one of the < 2 s branching programs B' in B of height q such that the path tt = ttb' (%) produces at least 
205 outputs at unique positions in x that are correct for x. 



Consider a single such program B'. By Lemma 2.5 for any path tt in B', the fraction of inputs x such 
that ttb'(x) = tt for which 205* of these outputs are correct for x and produced at unique positions, and £ 
holds for x is at most (17/18) 205 < 3~ s . By Proposition 2.4 this same bound applies to the fraction of 
all inputs x with ttb'(x) = tt for which 205 of these outputs are correct from x and produced at unique 
positions, and £ holds for x is at most (17/18) 205 < 3~ s . 

Since the inputs following different paths in B' are disjoint, the fraction of all inputs x for which £ holds 
and which follow some path in B' that yields at least 205 correct answers from distinct runs of x is less than 
3~ s . Since there are at most 2 s such height q branching programs, one of which must produce 205 correct 
outputs from distinct runs of x for every remaining input, in total only a 2 S 3~ S = {2/3) s fraction of all 
inputs have these outputs correctly produced. 

In particular this implies that B is correct on at most an 4ne _n//50 + (2/3) s fraction of inputs. For n 
sufficiently large this is smaller than 1 — rj for any rj < 1 — 2~ ss for some 5 > 0, which contradicts our 
original assumption. This completes the proof of Theorem 2.1 □ 



Lower Bound for Fjr n , k > 2 We modify the proof of Theorem 2.1 to prove the claim for integer k > 2 



We use the same hard distribution for the input, where the number of distinct numbers in each sliding window 
is within the range [0.5n, 0.85n] with high probability. We follow the same analysis for the correctness of 
a path tt in a branching program, along which at least r outputs from r unique positions are produced. The 
main difference is in computing the difference between consecutive outputs: 



Vi ~ Vi-i 



+i 



Si 



(t-i) 



+ 



(t-1) 

1 



1 



Ai-\) 

JXi-X 



(3) 



If Xi+n-i does not appear in the (i — l) st window {xi-\, . . . , Xj+ n _2}, then the first term's contribution to 
the sum is exactly 1, otherwise it is strictly bigger than 1. Similarly, if x%-\ does not appear in that window, 
then the second term's contribution to the sum is exactly -1, otherwise it is strictly smaller than -1. 

In order to compute the probability of correctness an output yj e , we reveal the following information (in 
the same fashion we did in the case k = 0): 



the value of the output right before the £ th output, Vj e ~i 
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• the number of distinct elements in the (ji — l) st window {xj t -i, . . . , Xj l+n -2], Fq 1 1 \ 

• the frequency of the left border Xj t -\ in the window Wj t -\, f^/S^- 

Denoting the event {yj e -i = M,F^ e ^ = m, f^i.\ = Jl, for some values M E N, m E [n] and 
/l E [n] by £%, we upper bound the probability of correctness of yj e by: 

Pr[y jt = Zj v £\£z,£i-i,-x B >(x) = w] < max Pr [y je = z je \£ x ,£e-i,iT B >(x) = it] 

x mg[0.5n.,0.85n] x 

M,f L 

We write the probability of correctness of yj e in terms of the difference in Equation [3] 

max Pr [y je = z je \£ Xl £e~i, k b > (x) = n] 
m,M,fL x 

= max Pr [y je - j/,- i = z je - M\£ x , tt b >(x) = n] 
m,M,jL,fa 

= + l ^ ~ + {fL-l) k -f k L = Z n -M | 



= max Pr 

m, M, f L 

< max Pr 

m, M, f L 



£ X ,£e-l,TT B '(x) = 7r] 



l{i j(+ „_ieiyj r i} 



£z,£l-l,K B >{x) = 7T 

fx, £g-i,Tr B >(x) = 7r for some c E {0, 1} 



where h(f L , Zj t ,M) is the integer solution to the equation: (/ + l) k - f k = z je - M - (f L - l) k - f k . 
This equation was derived by applying equation ([3]) to the difference of the two consecutive outputs yj e and 
yj e -i and replacing all the revealed variables by their values. The last inequality follows from the fact that 

fxf ( +n-i = iff 1{ccj +n _ieWj _i} = 0- This last probability is upper bounded by 17/18 for both values of 
c as in the analysis of the case k = 0. The rest of the proof follows. 

Lower bound for (Fq mod 2) ffln We prove a time-space tradeoff TS € Vt{n 2 ) for the problem of com- 
puting the parity of Fq for each sliding window, (Fq mod 2) ffln . We describe how to modify the proof of 
Theorem 12. II to derive the lower bound. 

Using the same hard distribution on the inputs, we guarantee that Fq for all windows takes values in the 
range [0.5ra, 0.85n]. When the input is chosen at random from [n] 2n_1 , this happens with high probability, 



as we showed in Lemma 2.4 Moreover, under the same distribution, a linear fraction of the outputs will be 



produced at unique positions as shown in Lemma 2.4 



We follow the same technique of dividing the branching program into layers of height q = en and then 
computing the correctness of a path it in a small branching program in computing r outputs produced at r 
unique positions. 

To upper bound the probability of correctness of the r outputs, we apply the chain rule and we reveal 
information in the same fashion as we did for F^ n : for an output yj t of (Fq mod 2) ffln (taking values in 
{0, 1}), we reveal: 

• the value of the indicators 1^ _ x gWj }> denoted by b E {0, 1} and 

• the value of Fq* *\ the number of distinct elements in the window of the output right before yj e . 
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Therefore, the probability of correctness of yj e is upper bounded by: 



max Pr 

me[0.5n,0.85n] 

6e{o,i} 



F, 



m ^{=0 jf -^W jf } = b,£l,lT B ,(x) = 7T 



Therefore, for a, c G {0, 1} the probability of correctness of yj e is given by (we drop the range of m in 
the following for convenience): 



max Pr 

m, b 

maxPr 

m, b 

max Pr 

m, b 



Vk 



Zj e , £ 



Vk =a, £ 



r 



Hxj.-igWjA = b,£e,TT B '(x) = vr 



o 



m ^{x 3e ^w 3e } = b,£e,ir B '(x) = vr 



j^ + n — 15 



c, £ 



F, 



ik-i) 



m,l {a 



,} = b,£e,TT B '(x) = IT 



where c is given by the following: c = 6 if a = m mod 2 and 1 — 6 otherwise. Hence the probability 
of correctness of yj e can be upper bounded by 17/18 as we show in the proof of Theorem 
remaining steps of the proof follow to yield a time-space tradeoff T • S G Q,(n 2 ). 



2.1 



and the 



2.2 A time-space efficient algorithm for Fj~ n 

We now show that the above time-space tradeoff lower bound is nearly optimal even for restricted RAM 
models. 

Theorem 2.6. There is a comparison-based deterministic RAM algorithm for computing F^ n for any fixed 
integer k > with time-space tradeoff T ■ S G 0(n 2 log 2 n)for all space bounds S with log n < S < n. 

Proof. We denote the i-th output by yi = Fk(xi, . . . , Xj +n _i). We first compute y\ using the comparison- 
based time 0(n 2 /S) sorting algorithm of Pagter and Rauhe |[23l . This algorithm produces the list of outputs 
in order by building a space S data structure D over the n inputs and then repeatedly removing and returning 
the index of the smallest element from that structure using a POP operation. We perform POP operations 
on D and keep track of the last index popped. We also will maintain the index i of the previous symbol seen 
as well as a counter that tells us the number of times the symbol has been seen so far. When a new index j 
is popped, we compare the symbol at that index with the symbol at the saved index. If they are equal, the 
counter is incremented. Otherwise, we save the new index j, update the running total for using the /c-th 
power of the counter just computed, and then reset that counter to 1. 

Let S' = S/logn. We compute the remaining outputs in n/S' groups of S' outputs at a time. In 
particular, suppose that we have already computed y^. We compute j/i+i, . . . , Vi+s' as follows: 

We first build a single binary search tree for both Xi, . . . , Xi + s'-i and for Xi+ n , . . . , Xi +n+ s'-i and 
include a pointer p(j) from each index j to the leaf node it is associated with. We call the elements 
Xi, . . . , Xi + s'-i the old elements and add them starting from x i+ s'-i- While doing so we maintain a counter 
Cj for each index j G [i, i + S' — 1] of the number of times that Xj appears to its right in x^, . . . , X{ + s'-i- 
We do the same for Xi +n , . . . , Xi +n+ s>-i, which we call the new elements, but starting from the left. For 
both sets of symbols, we also add the list of indices where each element occurs to the relevant leaf in the 
binary search tree. 

We then scan the n — S' elements Xj+s', . . . , Xi+ n -i and maintain a counter C(£) at each leaf I of each 
tree to record the number of times that the element has appeared. 
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For j e [i, i + S' — 1] we produce yj+i from yj. If Xj = Xj +n then yj+i = yj. Otherwise, we can use 
the number of times the old symbol Xj and the new symbol Xj +ri occur in the window Xj+i, . . . , Xj+n—i to 
give us yj+i- To compute the number of times Xj occurs in the window, we look at the current head pointer 
in the new element list associated with leaf p(j) of the binary search tree. Repeatedly move that pointer to 
the right if the next position in the list of that position is at most n + j — 1. Call the new head position 
index I. The number of occurrences of Xj in Xj+i, . . . , xgi and x n+ ±, . . . , x n+ j is now Cj + q. The head 
pointer never moves backwards and so the total number of pointer moves will be bounded by the number of 
new elements. We can similarly compute the number of times Xj +n occurs in the window by looking at the 
current head pointer in the old element list associated with p(j + n) and moving the pointer to the left until 
it is at position no less than j + 1. Call the new head position in the old element list £'. 

Finally, for k > we can output yj + i by subtracting (1 + Cj + q + C(p(j))) k — (cj + C£ + C(p(j)) k 
fromyj and adding (1 + Cj +n + C£> +C(p(j + n))) k — (cj +n + C£i + C(p(j + n)) k . When A; = we compute 
y j+ i by subtracting the value of the indicator l Cj+Ce+ c(p(j))=o from Vj and adding l Cj+n+Ci , +C (p(j+n))=o- 

The total storage required for the search trees and pointers is 0(S' log n) which is O(S). The total time 
to compute . . . , yi+s' is dominated by the n — S' increments of counters using the binary search tree, 
which is 0(n\og S') and hence 0(nlogS) time. This computation must be done (n — l)/S' times for a 
total of 0{"^^) time. Since S' = S/ log n, the total time including that to compute yi is 0( " 2 log ^ log5 ) 
and hence T ■ S £ 0{n 2 log 2 n). □ 



3 Element Distinctness is easier than F mod 2 

In this section we investigate the complexity of ED mn and show that it is strictly easier than (Fq mod 2) ffln . 
This fact is established by giving a particularly simple errorless algorithm for ED mn which runs in linear 
time on average on random inputs with alphabet [n] and hence also beats our strong average-case lower 
bound for (F mod 2) mn . 

We also give a deterministic reduction showing that the complexity of ED mn is very similar to that of 
ED. In particular, ED mn can be computed with at most an 0(log 2 n) additive increase in the space and 
0(log n) multiplicative increase in time more than required to compute a single instance of ED. This 
shows that any deterministic or randomized algorithm for ED that satisfies T ■ S G o(n 2 / log 3 n) would 
provide a worst-case separation between the complexities of ED mn and (Fq mod 2) ffln . 

3.1 A fast average case algorithm for ED mn with alphabet [n] 

We show a simple average case 0-error sliding- window algorithm for ED mn . When the input alphabet is 
chosen uniformly at random from [n], the algorithm runs in 0(n) time on average using O(logn) bits of 
space. By way of contrast, in Section^ we proved an average case time-space lower bound of T ■ S G 0(n 2 ) 
for (Fq mod 2) ffl " under the same distribution. 

The method we employ is as follows. We start at the first window of length n of the input and perform 
a search for the first duplicate pair starting at the right-hand end of the window and going to the left. We 
check if a symbol at position j is involved in a duplicate by simply scanning all the symbols to the right of 
position j within the window. If the algorithm finds a duplicate in a suffix of length x, it shifts the window 
to the right by n — x + 1 and repeats the procedure from this point. If it does not find a duplicate at all in 
the whole window, it simply moves the window on by one and starts again. 
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In order to establish the running time of this simple method, we will make use of the following birthday- 
problem-related facts. 

Lemma 3.1. Assume that we sample i.u.d. with replacement from the range {1 . . . n}. Let X be a discrete 
random variable that represents the number of samples taken when the first duplicate is found. Then 

Pr(X > n/2) < e~^. (4) 
and there exists an absolute constant B > such that 

E(X 2 ) < Bn. (5) 

Proof. For any x we have 

X—l / . \ x—1 



x — 1 / . \ x — ± 

Pr(X>z) = n(l--M <II e "" ^ e 

i=l ^ ' i=l 



Inequality (|4]) follows by substituting x = n/2 which yields Pr (X > ^) < e s . To prove inequality Q, 
recall that for non-negative valued discrete random variables 

oo 

EpT) = J2 Fr ( x ^ x )- 



x=l 



Observe that 



E(X 2 ) = jr Pr(X 2 > s) = Pr (^ > V5) 



x=l x=l 



x=l 



OO 2 

e 2n < e 2n rfx. 



x=0 



The final integral is bounded above by a constant multiple of n and so there exists a B such that E(X 2 ) < 
Bn. □ 

We can now show the running time of our average case algorithm for ED mn . 

Theorem 3.2. Assume that the input is sampled i.u.d. with replacement from alphabet [n\. ED mn can be 
solved in T G 0(n) time on average and space S G 0(log n) bits. 

Proof. Let U be a sequence of values sampled uniformly from [n] . Let M be the index of the first duplicate 
in U found when scanning from the right and let X = n — M. Let W(X) be the number of comparisons 
required to find X. Using our naive duplicate finding method we have that W(X) < X( X + l)/2. It also 
follows from inequality (|5]> that E(W) < Bn. 

Let R(n) be the total running time of our algorithm and note that R(n) < n 3 /2. Furthermore the 
residual running time at any intermediate stage of the algorithm is at most R(n). 

Let us consider the first window and let M\ be the index of the first duplicate from the right and let 
X\ = n — Mi. If X\ > n/2, denote the residual running time by We know from Q that Pr(Xi > 

n — 4 

n/2) < e ~s~. If X\ < n/2, shift the window to the right by Mi + 1 and find X2 for this new window. If 
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X% > n/2, denote the residual running time by R( 2 \ We know that Pr(X 2 > n/2) < e s . If X\ < n/2 
and X2 < n/2 then the algorithm will terminate, outputting 'not all distinct' for every window. 
The expected running time is then 



E(R{n)) = E (W(Xi)) + E (rF>) Pr (X x > 



n 



+ Pr (Xx < |) [f? (w(Jf 2 )|*i < |) + E (r^) Pr (x 2 > ^\x, < |) 

„ n 3 ^ n 3 . 

< Bn+ye s + £ n +_ e 8 eO(n) 

The inequality follows from the followings three observations. We know trivially that Pr(Xi < n/2) < 
1. Second, the number of comparisons W{X2) does not increase if some of the elements in a window are 
known to be unique. Third, Pr(X 2 > n/2 A X x < n/2) < Pv(X 2 > n/2) < e'^. □ 

We note that similar results can be shown for inputs uniformly chosen from the alphabet [cn] for any 
constant c. 

3.2 Sliding windows do not significantly increase the complexity of element distinctness 

As preparation for the main results of this section, we first give a deterministic reduction which shows 
how the answer to an element distinctness problem allows one to reduce the input size of sliding-window 
algorithms for computing ED^ m . 

Lemma 3.3. Let n > m > 0. 

(a) If ED n _ m+1 (x m , ...,x n )=0 then ED® m ( Xl , . . . ,x n+m _i) = m . 

(b) IfED n - m+ i(x m , ...,x n ) = l then define 

i. %]_, = max{j £ [m - 1] I ED n ^j + i(xj, . . . , x n ) = 0} where %l = if the set is empty and 

ii. ir = min{j £ [m — 1] | ED n ^ m+ j(x m , . . . , x n +j) = 0} where ir = m if the set is empty. 

Then 

ED® m (x 1 ,...,x n+m - 1 ) = iL l m - iL A l'KO™-** A ED®™ 1 (x 1 ,...,x m - 1 ,x n+1 ,...,x n+m - 1 ) 

where each A represents bit-wise conjunction. 

Proof. The elements M = (x m , ...,x n ) appear in all m of the windows so if this sequence contains 
duplicated elements, so do all of the windows and hence the output for all windows is 0. This implies part 
(a). 

If M does not contain any duplicates then any duplicate in a window must involve at least one element 
from L = (xi, . . . , x m -\) or from R = (x n+ i, . . . , x n+m -\). If a window has value because it contains 
an element of L that also appears in M, it must also contain the rightmost such element of L and hence 
any window that is distinct must begin to the right of this rightmost such element of L. Similarly, if a 
window has value because it contains an element of R that also appears in M, it must also contain the 
leftmost such element of L and hence any window that is distinct must end to the left of this leftmost such 



15 



element of R. The only remaining duplicates that can occur in a window can only involve elements of both 
L and R. In order, the m windows contain the following sequences of elements of L U R: (x\, . . . , x m -\), 
(x 2 , ■ ■ .,x m -i,x n+ i), . . ., (x m -i,x n+ i, . . . ,x n+m _ 2 ), (x n+ i, . . .,x n+m -i). These are precisely the se- 
quences for which ED^ix!,... , x m -i, x n +i, ■ ■ ■ , x n + m -i) determines distinctness. Hence part (b) fol- 
lows. □ 

We use the above reduction in input size to show that any efficient algorithm for element distinctness 
can be extended to solve element distinctness over sliding windows at a small additional cost. 

Theorem 3.4. If there is an algorithm A that solve element distinctness, ED, using time at most T(n) and 
space at most S(n), where T and S are nondecreasing functions of n, then there is an algorithm A* that 
solves the sliding-window version of element distinctness, ED^ n , in time T* (n) that is 0(T(n) log 2 n) and 
space S*(n) that is 0(S(n) + log 2 n). Moreover, ifT(n) is 0(n l3 )for (3 > 1, then T*(n) is 0(vP logn). 

If A is deterministic then so is A*. If A is randomized with error at most e then A* is randomized with 
error o(l/n). Moreover, if A has the obvious I -sided error (it only reports that inputs are not distinct if it is 
certain of the fact) then the same property holds for A*. 

Proof. We first assume that A is deterministic. Algorithm A* will compute the n outputs of ED^ n in n/m 



groups of m using the input size reduction method from Lemma 3.3 In particular, for each group A* will 
first call A on the middle section of input size n — m + 1 and output m if A returns 0. Otherwise, A* will 
do two binary searches involving at most 2 log to calls to A on inputs of size at most n to compute i l and 
ijl as defined in part (b) of that lemma. Finally, in each group, A* will make one recursive call to A* on a 
problem of size to. 

It is easy to see that this yields a recurrence of the form 

T*(n) = (n/m)\cT(n) log to + T*(m)]. 

In particular, if we choose m = n/2 then we obtain T*(n) < 2T*(n/2) + 2cT(n) logn. If T(n) is 0(r/) 
for /3 > 1 this solves to T*(n) = 0(n^ logn). Otherwise, it is immediate from the definition of T[n) that 
T(n) must be Q(n) and hence the recursion for A* has O(logn) levels and the total cost associated with 
each of the levels of the recursion is 0(T(n) log n). 

Observe that the space for all the calls to A can be re-used in the recursion. Also note that the algorithm 
A* only needs to remember a constant number of pointers for each level of recursion for a total cost of 
0(log 2 n) additional bits. 

We now suppose that the algorithm A is randomized with error at most e. For the recursion based on 



Lemma 3.3 we use algorithm A and run it C = 0(log n) times on input (x m , . . . , x n ), taking the majority 
of the answers to reduce the error to o(l/n 2 ). In case that no duplicate is found in these calls, we then 
apply the noisy binary search method of Feige, Peleg, Raghavan, and Upfal lPT8l to determine il and iji 
with error at most o(l/ n 2 ) by using only C = 0(log n) calls to A. (If the original problem size is n we will 
use the same fixed number C = O(logn) of calls to A even at deeper levels of the recursion so that each 
subproblem has error o(l/n 2 ).) There are only O(n) subproblems so the final error is o(l/n). The rest of 
the run-time analysis is the same as in the deterministic case. 

If A has only has false positives (if it claims that the input is not distinct then it is certain that there is a 
duplicate) then observe that A* will only have false positives. □ 
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4 Order Statistics in Sliding Windows 



We first show that when order statistics are extreme, their complexity over sliding windows does not signif- 
icantly increase over that of a single instance. 

Theorem 4.1. There is a deterministic comparison algorithm that computes MAX^ n (equivalently MIN^ n ) 
using time T G 0(n log n) and space S G O(logn). 

Proof. Given an input x of length 2n — 1, we consider the window of n elements starting at position [~^] 
and ending at position n + [S] — 1 and find the largest element in this window naively in time n and space 
O(logn); call it m. Assume without loss of generality that m occurs between positions and n, that is, 
the left half of the window we just considered. Now we slide the window of length n to the left one position 
at a time. At each turn we just need to look at the new symbol that is added to the window and compare it 
to m. If it is larger than m then set this as the new maximum for that window and continue. 

We now have all outputs for all windows that start in positions 1 to For the remaining outputs, 
we now run our algorithm recursively on the remaining n + [f]-long region of the input. We only need 
to maintain the left and right endpoints of the current region. At each level in the recursion, the number of 
outputs is halved and each level takes O(n) time. Hence, the overall time complexity is 0{n log n) and the 
space is O (log n). □ 

In contrast when an order statistic is near the middle, such as the median, we can derive a significant sep- 
aration in complexity between the sliding- window and a single instance. This follows by a simple reduction 
and known time-space tradeoff lower bounds for sorting lfTT1 l8Tl. 

Theorem 4.2. Let P be a branching program computing 0^ n in time T and space S on an input of size 
2n — 1, for any t G [n]. Then T • S G Q(t 2 ) and the same bound applies to expected time for randomized 
algorithms. 

Proof. We give lower bound for Of n for t G [n] by showing a reduction from sorting. Given a sequence s 
of t elements to sort taking values in {2, . . . , n — 1}, we create a 2n — 1 length string as follows: the first 
n — t symbols take the same value of n, the last n — 1 symbols take the same value of 1 and we embed the t 
elements to sort in the remaining t positions, in an arbitrary order. For the first window, O t is the maximum 
of the sequence s. As we slide the window, we replace a symbol from the left, which has value n, by a 
symbol from the right, which has value 1 . The t th smallest element of window i = 1, . . . , t is the i th largest 
element in the sequence s. Then the first t outputs of Of n are the t elements of the sequence s output in 
increasing order. The lower bound follows from ifTTl l8l. As with the bounds in Corollary 2.2 the proof 



methods in IfTTl [8| also immediately extend to average case and randomized complexity. □ 

For the special case t = \¥f\ (median), we note that the best lower bound known for the single-input 
version of the median problem is T G fi(n log log 5 n) derived in lfT5l for S G w(log n) and this is tight for 
the expected time of errorless randomized algorithms. 
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